Gravitational Wave Confusion Noise 
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One of the greatest challenges facing gravitational wave astronomy in the low frequency band 
is the confusion noise generated by the vast numbers of unresolved galactic and extra galactic 
binary systems. Estimates of the binary confusion noise suffer from several sources of astrophysical 
uncertainty, such as the form of the initial mass function and the star formation rate. There is 
also considerable uncertainty about what defines the confusion limit. Various ad-hoc rules have 
been proposed, such as the one source per bin rule, and the one source per three bin rule. Here 
information theoretic methods are used to derive a more realistic estimate for the confusion limit. 
It is found that the gravitational wave background becomes unresolvable when there is, on average, 
more than one source per eight frequency bins. This raises the best estimate for the frequency at 
which galactic binaries become a source of noise from 1.45 mHz to 2.54 mHz. 



Population synthesis models and astronomical obser- 
vations suggest that there may be as many as 2 x 10 8 
galactic disk binaries containing two compact objects. 
Several million of these binaries arc expected to emit 
gravitational waves that can be detected by the Laser 
Interferometer Space Antenna (LISA). At frequencies be- 
low ~ 1 mHz these systems constitute an embarrass- 
ment of riches for gravitational wave astronomy as the 
number of sources is so great that it becomes impossible 
to isolated individual binaries. The unresolved galactic 
gravitational wave background is a source of noise that 
degrades LISA's ability to detect other sources, such as 
extra-galactic supermassive black hole binaries. 

A much quoted definition of the gravitational wave 
confusion noise level is the amplitude at which there 
is, on average, at least one source per frequency bin. 
Even this crude definition takes into account some data 
analysis considerations as the width of a frequency bin, 
A/ = 1/T, depends on the observation time T. The 
longer the observation, the smaller the confusion noise. 
However, Hellings [lj pointed out that there was not 
enough information in one frequency bin to fully spec- 
ify a binary system. A monochromatic, circular binary is 
described by the seven parameters (/, In A, 9, <fi, i, ip, (po), 
where / is the gravitational wave frequency, A is the 
intrinsic amplitude, (9, <fi) describe the sky location, i de- 
notes the inclination, ip is the polarization angle and tpo is 
the orbital phase. Hellings reasoned that each frequency 
bin only contains two pieces of information - the real and 
imaginary parts of the strain spectral density - while six 
pieces of information are needed to fix (In A, 9, <j>, i, ip, <fo)- 
Thus, at least three frequency bins are required to solve 
for the six parameters. The frequency was not included 
in the information budget on the grounds that it came 
for free from the Fourier transform. While the three bin 
rule is an improvement over the naive one bin rule, it 
lacks a rigorous justification. More recently, Phinney Q 
has revisited the confusion noise question by using Shan- 
non information theory to estimate the total number of 
binaries than can be resolved by LISA. Here a similar ap- 
proach is used to estimate the number of frequency bins 
that are needed to resolve a binary system. 

The estimate is made by comparing the information 



contained in one frequency bin with the amount of infor- 
mation required to record the parameters that describe 
a binary system. Both of these quantities depend on the 
signal to noise ratio of the instrument. A higher signal 
to noise ratio means more information in each bin, but 
it also means that the binary parameters are better de- 
termined, and thus take more information to describe. 

The Baud rate of the detector, C, depends on the 
bandwidth of the signal, B, and the signal to noise ratio 
S/N according to 



C = Blog 2 [l 
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The total number of bits transmitted during an observa- 
tion time T is simply CT. Setting the bandwidth equal to 
Af = 1/T, it follows that the amount of information per 
bin is equal to AI — log 2 (l + 4?). Because the LISA 
observatory is designed to operate in stereo, i.e. LISA 
can detect both gravitational wave polarizations |3|], the 
total amount of information available in each frequency 
bin is larger than AI, but not a full factor of two larger as 
the information in each channel is not fully independent. 
The analysis of the stereo performance is further com- 
plicated by the correlated noise in the two channels. In 
what follows, the information budget is calculated for a 
single channel, with the expectation that the result would 
be little changed by including the second channel. 

The amount of information required to record the pa- 
rameters of a binary system can be estimated by com- 
paring the volume of the error ellipsoid to the volume 
of the parameter space. To see how this works, consider 
the following one dimensional example. Suppose that the 
angle <fr can be measured to an accuracy of three degrees. 
Recording to this accuracy takes log 2 (360/3) w 7 bits 
of data. The volume of the seven dimensional error ellip- 
soid that characterizes the binary subtraction problem 
for LISA can be estimated by calculating the determi- 
nant of the Fisher information matrix Ty. To calculate 
the Fisher matrix an expression is needed that describes 
how information about the binary is encoded in the the 
detector output. At low frequencies, the signal in each 
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LISA channel takes the form 



8(t) = F+{t,iP,e,(f>)A + (A,i)cosy(tJ 1 9,^cp Q ) 

+F X (t, V, 0, 0)A X (^4, t) sin /, 0, 0, ^o) ,(2) 

tf (t, /, 0, 0, p„) = 27T/i + $ D (t, /, 0, 0) + • (3) 



where 



Expressions for the detector response functions F + and 
F x , the Doppler modulation and the gravitational 
wave amplitudes A + and A x can be found in Ref. Q. 
The components of the Fisher matrix are found using the 
expression Q 
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diS a (t)djS a (t) dt , 



(4) 



where the components i, j range over the seven param- 
eters A — * (/, hi A, 9, 4>, i, t/j, ipo), and S n (f) is the one- 
sided noise spectral density. In general, the Fisher matrix 
is a complicated function of all seven parameters, which 
implies that the volume of the error ellipsoid varies de- 
pending on where the binary sits in parameter space. 
Assuming that the noise is Gaussian, and working in the 
limit where the signal is large compared to the noise, the 
volume of the seven dimensional error ellipsoid is give by 



AV(A) = 



7T 3 V2 
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det(C„(A)) : 



(5) 



where Ci. 



is the noise covariance matrix, Ci. 



(AAjAAj). The amount of information needed to de- 
scribe or subtract a binary source emitting gravitational 
waves with frequency / is given by 



1(B, A) = log, 



( VjB) \ 
\AV(X) 



(6) 



where V(B) is the volume of the parameter space in a 
bandwidth B. The bandwidth needed to subtract a typi- 
cal binary can be estimated by solving the transcendental 
equation 



HB) =BT\n : , 2 ( I 



(7) 



for B. Here T(B) is the average amount of information 
required to remove one binary, and S/N 3> 1 is the av- 
erage signal to noise ratio across the bandwidth. Note 
that the left hand side of the transcendental equation 
depends logarithmically on the bandwidth and the obser- 
vation time, while the right hand side depends linearly 
on these quantities. 

Calculating T(B, A) is a straightforward, yet computa- 
tionally intensive task. The volume of the error ellipsoid 
can be expressed in terms of the eigenvalues erf of the 
covariance matrix C;, : 



AV(A) 



'lJ.fi (A) ■ 
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(8) 



The eigenvectors of the covariance matrix lie along the 
principle axes of the error ellipsoid, which may not be 
aligned with the coordinate axes. The Fisher matrix can 
be expressed in the form 



r<J ' (X) = (I) 7«(/.M,»,V0 

where the signal strength is given by 



S 2 = I s(t) 2 dt 
la 



(9) 



(10) 



and 

Hi : 



f \d i C + d j C x + C x (diC+djV + 0,-C+fij*) 
Jo L 

+d 3 C+d t C x - C+ {diCxdjV + djCxdiV) 



(Cl + C x ) diVdjV 



dt 



{C 2 + + C 2 x )dt 



(11) 



Here C+(t) = F+(t)A+ and C x (t) = F x (t)A x . The am- 
plitude modulation functions C+{t) and C x (t) depend on 
(ln„4, i, tj), 9, <f>) but are independent of / and <po, while 
the phase function ^(t) depends on (/, 0, y> ) but is 
independent of In A, i and ip. In other words, the ampli- 
tude modulation terms are solely responsible for deter- 
mining the intrinsic amplitude, inclination and polariza- 
tion, while the phase term is solely responsible for deter- 
mining the frequency and orbital phase. Both the am- 
plitude and phase play a role in fixing the sky location 
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FIG. 1: A histogram of the quantity log 10 (vol(A)). 

Using the expressions derived in Ref. Q it is a simple, 
yet laborious task to compute jij(f, 9, <f>, i, i/j). The final 
expressions are long and rather opaque. Most striking is 
the strong dependence on {f,9,<p,i,tp), which makes it 
exceedingly difficult to give simple answers to questions 
such as "what is the angular resolution of the LISA detec- 
tor?" . One trend that is obvious is the improvement in 
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the information encoding at high frequencies that comes 
from the Doppler modulation. Thus, to obtain a lower 
limit on the bandpass required to remove a source from 
the data stream one should work in the very low fre- 
quency limit (which is equivalent to setting <&£>(£) = 0). 
Even then 7^ shows a strong dependence on {9, cj>, 1, -0) 
that defies efforts to define "typical" parameter sensitiv- 
ities. Using JSJ and © we have 



Ar,A,= (^] vol(A). 



where 



vol(A) 



Tr 3 72 
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det( 7 „(A)) 



(12) 



(13) 



When calculating 7y the dimensionless quantity q = fT 
is used instead of /. Physically, q is the number of wave 
cycles in the observation period. In terms of the coor- 
dinates (q, hi A, 9, 4>, 1, tp, ip®), the volume V(B) is equal 
to 



V{B) = 47r 3 5T(Aln„4) ~ 2857V b 



(14) 



where the final expression uses the fact that A is typically 
within a factor of ~ 10 of the fixed signal strength S, 
and Nb = BT is the number of frequency bins covered 
by the bandwidth B. Combing equations 10, (|12|) and 
(fH|) yields 



5* 



Z(B,A) = 71og 2 ( - 



log 2 (285iV b )-log 2 (vol(A)). 

(15) 

While the quantity T(B, A) does vary with the binary's 
location in parameter space A, the variations are tamed 
by the logarithmic dependence on vol(A). Figurc^shows 
a histogram of the quantity log 10 (vol(A) in the very low 
frequency limit (/ ~ 0.1 mHz) for T = lyr of observa- 
tions. The histogram was generated by sampling (9, (j>) 
on a uniform HEALPIX grid [5| with 49152 pixels, and 
by sampling both 1 and ip uniformly at 200 points, mak- 
ing for a grand total of 1.97 x 10 9 samples. The quantity 
log 10 (vol(A)) was found to have a median value of 1.4 and 
a mean value of 2.3. Thus, the average value of T{B, A) 
is equal to 



Z(B) = 71og 2 



log 2 (1.43AT 6 ) 



Solving the transcendental equation Q yields 

]og a (10) 



log 2 (W 



(16) 



(17) 



Thus, for typical sources, Nb — 8. This result is in strik- 
ing disagreement with the recent work of Krolack and 
Tinto |8j, where a Fisher matrix based approach was 
used to derive the result Nb — 0.5. They argue that 
the number of sources than can be removed scales as 
V(B)/ AV(X), which is precisely opposite to the reason- 
ing used to derive Eqn. JBJ. 

It has been estimated 6] that the number of detached 
wd-wd binaries per 1/yr frequency bin scales as 



A^bin = 



1.445 mHz 



-11/3 



(18) 



Setting iVbin = 1/8 yields / = 2.54 mHz for the fre- 
quency cutoff below which the galactic population of wd- 
wd binaries becomes a confusion noise source for LISA. 
Increasing the limit from 1.445 mHz to 2.54 mHz may 
not sound like much, but it translates into a significant 
reduction in the number of galactic binaries LISA can 
resolve. 

The "eight bin rule" derived above is likely a fairly ro- 
bust estimate. Mostly it comes from the overall (N/S) 7 
factor that scales the size of the seven dimensional error 
ellipsoid. The estimate is fairly insensitive to how 7^ 
is calculated, so including the Doppler modulation will 
only slightly increase Nb- As argued earlier, utilizing the 
full stereo capabilities of LISA would reduce Nb, but by 
less than a factor of two. In Ref. @ a single channel was 
used to test an algorithm for identifying and subtracting 
binary systems from the LISA data stream. It was found 
that the subtraction worked for 3 sources separated by 
an average of 5 bins. This appears to contradict the eight 
bin rule, but information from surrounding bins was also 
used. The number of bins with S/N > 5 that contributed 
to the subtraction totaled ~ 40, so approximately 13 bins 
were used to remove each signal, and even then the sub- 
traction was imperfect. One of the key questions to be 
addressed in future work is how close can one get to the 
information theory limit. At least now we have a better 
idea of what that limit is. 
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